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DETAILED ACTION 

1. Formal Matters 

A. The Election dated 7/6/04 has been entered into the record. 

B. Claims 1-20 were pending and were subject to restriction in the Office Action mailed 5/3/04. 
Applicants elected Group I, claims 1-6, 9-15 and 19, with traverse. The traversal will be 
discussed below. Claims 7, 8, 16-18 and 20 have been withdrawn as being drawn to a non-elected 
invention. Therefore, claims 1-6, 9-15 and 19 are the subject of this Office Action. 

2. Answer to Traversal 

A. Applicants argue that SEQ ID N0:9, which is the polynucleotide encoding the polypeptide of 
SEQ ID NO:l should be searched. This argument is persuasive. Therefore, SEQ JD NO:l and 9 will be 
searched. 

5. Specification 

A. The title of the invention is not descriptive. A new title is required that is clearly indicative of the 
invention to which the claims are directed. 

The following title is suggested: G protein-coupled receptor proteins, encoding nucleic acids and 
methods of using the proteins. 

4* Claim Objections 

A. Claims 1-6, 9-15 and 19 are objected to since they recite non-elected SEQ ID NOs. 

5. Claim Rejections - 35 USC § 101 

35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or any new 
and useful improvement thereof, may obtain a patent therefor, subject to the conditions and requirements of this title. 

A. Claims 1-6, 9-15 and 19 are rejected under 35 U.S.C. 101 because the claimed invention is not 
supported by a specific, substantial and credible asserted utility or a well estabHshed utility. These claims 
are directed to a protein of SEQ ID NO:l, its encoding polynucleotide of SEQ ID N0:9 and methods of 
treating disorders. However, the invention encompassed by these claims has no apparent or disclosed 
patentable utility. This rejection is consistent with the current utility guidelines, published 1/5/01, 66 PR 
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1092. The instant application has provided a description of an isolated protein. However, the instant 
application does not disclose a specific and substantial biological role of this protein or its significance. 

However, it is clear from the instant specification that the claimed receptor is what is termed an 
"orphan receptor" in the art. The instant application does not disclose the biological role of the claimed 
protein or its significance in relation to any disease state. Applicants disclose in the specification that the 
claimed receptor is homologous to human epididymis-specific seven transmembrane receptor HE6. 
However, homology alone is not sufficient. There is little doubt that, after complete characterization, this 
protein will probably be found to have a patentable utility. This further characterization, however, is part 
of the act of invention and, until it has been undertaken, Applicants' claimed invention is incomplete. 

The instant situation is directly analogous to that of which was addressed in Brenner v. Manson, 
148 U.S.P.Q. 689 (Sus, Ct, 1966), in which a novel compound which was structurally analogous to other 
compounds which were known to possess anticancer activity was alleged to be potentially useful as an 
antitumor agent in the absence of evidence supporting this utility. The court expressed the opinion that all 
chemical compounds are "useful" to the chemical arts when this term is given its broadest interpretation. 
However, the court held that this broad interpretation was not the intended definition of "useful" as it 
appears in 35 U.S.C. 101, which required that an invention must have either an immediate obvious or 
fully disclosed "real-world" utility. The court held that: 

"The basic quid pro quo contemplated by the Constitution and the Congress for granting a patent 
monopoly is the benefit derived by the public from an invention with substantial utility," "[ujnless and until a 
process is refined and developed to this point - where specific benefit exists in currently available form - there 
is insufficient justification for permitting an applicant to engross what may prove to be a broad field," and "a 
patent is not a hunting license," "[i]t is not a reward for the search, but compensation for its successful 
conclusion." 

The specification discloses that the polypeptide of the invention has sequence similarity to a 
known human epididymis-specific seven transmembrane receptor HE6. Based on the structural similarity, 
the specification asserts that the newly disclosed SEQ ID NO:l and 9 have similar activities. The 
assertion that the disclosed proteins have biological activities similar to known receptors cannot be 
accepted in the absence of supporting evidence, because, generally, the art acknowledges that function 
cannot be predicted based solely on structural similarity to a protein found in the sequence databases. For 
example, Skolnick et al. (2000, Trends in Biotech. 18:34-39) state that knowing the protein structure by 
itself is insufficient to annotate a number of functional classes, and is also insufficient for annotating the 
specific details of protein function (see Box 2, p. 36). Similarly, Bork (2000, Genome Research 10:398- 
400) states that the error rate of functional annotations in the sequence database is considerable, making it 
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even more difficult to infer correct function from a structural comparison of a new sequence with a 
sequence database (see especially p. 399). Such concerns are also echoed by Doerks et al. (1998, Trends 
in Genetics 14:248-250) who state that (1) functional information is only partially annotated in the 
database, ignoring multi functionahty, resulting in underpredictions of functionality of a new protein and 
(2) overpredictions of functionahty occur because structural similarity often does not necessarily coincide 
with functional similarity. Smith et al. (1997, Nature Biotechnology 15:1222-1223) remark that there are 
numerous cases in which proteins having very different functions share structural similarity due to 
evolution from a common ancestral gene. 

Brenner (1999, Trends in Genetics 15:132-133) argues that accurate inference of function from 
homology must be a difficult problem since, assuming there are only about 1000 major gene 
superfamilies in nature, then most homologs must have different molecular and cellular functions. 
Finally, Bork et al. (1996, Trends in Genetics 12:425-427) add that the software robots that assign 
functions to new proteins often assign a function to a whole new protein based on structural similarity of a 
small domain of the new protein to a small domain of a known protein. Such questionable interpretations 
are written into the sequence database and are then considered facts. 

Therefore, based on the discussions above concerning the specific examples of structurally 
similar proteins that have different functions, along with the art's recognition that one cannot rely upon 
structural similarity alone to determine functionality, the specification fails to teach the skilled artisan the 
utility of the claimed polypeptide of SEQ ID N0:1 and the polynucleotides of SEQ ID N0:9 which are 
only known to be homologous to a known receptor. Therefore, the instant claims are drawn to a 
polynucleotide encoding a protein which has a yet undetermined function or biological significance. 
There is no actual and specific significance which can be attributed to said protein identified in the 
specification, or any association with a disease state. For this reason, the instant invention is incomplete. 
Li the absence of a knowledge of the natural ligands or biological significance of this protein, there is no 
immediately obvious patentable use for it. To employ a protein of the instant invention in the 
identification of substances which bind to and/or mediate activity of the said receptor is clearly to use it as 
the object of further research which has been determined by the courts to be a non-patentable utility. 
Since the instant specification does not disclose a "real-world" use for said protein then the claimed 
invention is incomplete and, therefore, does not meet the requirements of 35 U,S,C. 101 as being useful. 

Furthermore, since the nucleic acid and protein of the invention are not supported by a 
specific and substantial asserted utility or a well established utility, the vector, host cell as well as 
methods for producing and using the claimed polypeptide also lack utility. 
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6. Claim Rejections - 35 USC § 112, first paragraph - scope of enablement 

The following is a quotation of the first paragraph of 35 U.S.C. 1 12: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such flill, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

A. Claims 1-6, 9-15 and 19 are rejected under 35 U.S.C. 112, first paragraph, as failing to adequately 
teach how to use the instant invention. Specifically, since the claimed invention is not supported by a 
specific, substantial and credible asserted utility or a well established utility for the reasons set forth 
above, one skilled in the art clearly would not know how to use the claimed invention. 

B. Furthermore, even if claim 1-6, 9-15 and 19 possessed utility under 35 USC 101, they would still 
be rejected under 35 U.S.C. 1 12, first paragraph, because the specification, while then being enabling for 
the protein of SEQ ID N0:1 and the polynucleotide of SEQ ID N0:9, does not reasonably provide 
enablement for proteins which are at least "70% or 90%" identical to SEQ ID NO:l or 9, "fragments" 
thereof, or which "hybridize" to SEQ ID NO:9, Furthermore, the specification is not enabled for "host 
cells" which are not "isolated," for "pharmaceutical compositions" or for "methods of treating 
disorders" with these compositions. The specification does not enable any person skilled in the art to 
which it pertains, or with which it is most nearly connected, to make the invention commensurate in scope 
with these claims. 

hi hi re Wands, 8USPQ2d, 1400 (CAFC 1988) page 1404, the factors to be considered in 
determining whether a disclosure would require undue experimentation include (I) the quantity of 
experimentation necessary, (2) the amount of direction or guidance presented, (3) the presence or absence 
of working examples, (4) the nature of the invention, (5) the state of the prior art, (6) the relative skill of 
those in the art, (7) the predictability or unpredictability of the art, and (8) the breadth of the claims. 

First, the breadth of the claims is excessive with regard to claiming all proteins and 
polynucleotides which are "at least 70% or 90% identical" to SEQ ID N0:1 or 9, respectively, or those 
which "hybridize" under stringent conditions to SEQ ID NO:9. Nucleic acid molecules which are "at least 
70% or 90% identical" to SEQ ID NO:9," which "hybridize" to SEQ ID NO:9, or which comprise 
fi-agments of SEQ ID NO: 9 would have one or more nucleic acid substitutions, deletions, insertions 
and/or additions to the polynucleotide of SEQ ID N0:9. Similarly, proteins which are "at least 70% or 
90% identical" to the protein of SEQ ID N0:1, or comprise fragments thereof, would encode for a protein 
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with one or more amino acid substitutions, deletions, insertions and/or additions to the protein of SEQ ID 
NO:l. 

Applicants provide no guidance or working examples of nucleic acid molecules which hybridize 
to SEQ ID NO:l, or of proteins which are at least 70% or 90% identical to SEQ ID N0:1, or 
polynucleotides which are at least 70% or 90% identical to SEQ ID N0:9, nor do they provide a function 
of these nucleic acid molecules, or proteins. Furthermore, fragments can be as few as one amino acid, or 
nucleic acid. Applicants have provided no guidance as to what critical residues are required to maintain 
the functional characteristics of any protein other than that of the full-length of SEQ ID N0:1 or the 
polynucleotide of SEQ ID NO:9. Furthermore, it is not predictable to one of ordinary skill in the art how 
to make a functional protein which is less than 100% identical to that of SEQ ID N0:1, or the 
polynucleotide of SEQ ID N0:9. 

In addition, Applicants are not enabled for host cells which are not "isolated." As stand, claim 13 
reads on gene-therapy since the cells transfected with the DNA could be in, for example, a human host. 
Claims 15 and 19 recite "pharmaceutical compositions." However, Applicants have not provided any 
guidance and working examples of the use of the protein of the invention to treat any disorder, nor are 
Applicants enabled for the scope of claim 19 since there are no examples of any disorders to be treated. 

In summary, the breadth of the claims is excessive with regard to Applicants claiming all proteins 
and polynucleotides which are at least "70% or 90%" identical to SEQ ID N0:1 or 9, "fragments" 
thereof, or polynucleotides which "hybridize" to SEQ ID N0:9. There is also a lack of guidance and 
working examples of these nucleic acid molecules and proteins as well as which residues are critical for 
protein function. Similarly, the specification does not enable host cells which are not isolated, 
pharmaceutical compositions or methods of treating any claimed disorder. These factors, along with the 
lack of predictability to one of ordinary skill in the art as to how to make a functional proteins and 
polynucleotides other that that of SEQ ID NO: 1 and 9, or how to use pharmaceutical compositions to treat 
the claimed disorders, leads the Examiner to hold that undue experimentation is necessary to practice the 
invention as claimed. 
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7. Claim Rejections - 35 USC § 11 2, first paragraph - written description 

A. Claims 1-6, 9-15 and 19 are rejected under 35 U.S.C. 112, first paragraph, as containing subject 
matter which was not described in the specification in such a way as to reasonably convey to one skilled 
in the relevant art that the inventor(s), at the time the application was filed, had possession of the claimed 
invention. 

These are genus claims. Nucleic acid molecules which are "at least 70% or 90% identical" to 
SEQ ID N0:9," which "hybridize" to SEQ ID NO:9, or which comprise "fragments" of SEQ ID NO: 9 
would have one or more nucleic acid substitutions, deletions, insertions and/or additions to the 
polynucleotide of SEQ ID NO:9. Similarly, proteins which are "at least 70% or 90% identical" to the 
protein of SEQ ID N0:1, or comprise fragments thereof, would encode for a protein with one or more 
amino acid substitutions, deletions, insertions and/or additions to the protein of SEQ ID NO: 1 . 

The specification and claims do not indicate what distinguishing attributes are shared by the 
members of the genus. Thus the scope of the claims includes numerous structural variants, and the genus 
is highly variant because a significant number of structural differences between genus members is 
permitted. The specification and claims do not provide any guidance as to what changes should be made. 
Structural features that could distinguish compounds in the genus fi:-om others in the nucleic acid or 
protein class are missing from the disclosure. No common structural attributes identify the members of 
the genus. The general knowledge and level of skill in the art do not supplement the omitted description 
because specific, not general, guidance is what is needed. Since the disclosure fails to describe the 
common attributes or characteristics that identify members of the genus, and because the genus is highly 
variant, SEQ ID NO:l and 9, or molecules which hybridize to the polynucleotides of SEQ ID N0:9 
(which could be at least thousands of molecules) are insufficient to describe the genus. One of skill in the 
art would reasonable conclude that the disclosure fails to provide a representative number of species to 
describe the genus. Thus, Applicant was not in possession of the claimed genus at the time the invention 
was made. 

This rejection also pertains to the claimed "pharmaceutical compositions" and "methods of 
treating disorders." The specification does not provide adequate written description of "pharmaceutical 
compositions" to treat any disorder, nor does it provide written description of any disorders to be treated 
with these compositions. 
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8, Claim Rejections - 35 USC§ 112, second paragraph 

The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

A. Claim 5 is vague and indefinite since the claim recites "stringent conditions." It is not known 
what these conditions are. Nucleic acid molecules which hybridize under conditions of "low" stringency 
would not necessarily hybridize under conditions of "high" stringency. Furthermore, not all conditions of 
"high" or "low" stringency, for example, are the same. Therefore, it is required that Applicants amend the 
claims to recite the exact hybridization conditions without using indefinite phrases such as "for example'' 
without adding new matter. 

B. Claim 19 is rejected under 35 U.S.C. 112, second paragraph, as being incomplete for omitting 
essential steps, such omission amounting to a gap between the steps. See MPEP § 2172.01. The omitted 
steps are: a conclusion step demonstrating when the claimed method has been performed (i.e. when the 
objective of the claim has been reached). The claim recites a method of treating a disorder. However, 
there is no steps which determine, or identify, when the claimed disorder has been treated. 

A Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the basis 
for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(a) the invention was known or used by others in this country, or patented or described in a printed publication in this 
or a foreign country, before the invention thereof by the applicant for a patent. 

A. Claims 1-6, 9-15 and 19 are rejected under 35 U.S.C. 102(a) as being anticipated by Wood et al. 
(WO 99/46281). The claims recite a polypeptide of SEQ ED N0:1, fragments thereof, as well as encoding 
polynucleotides, or fragments thereof. The claims also recite vectors, host cells, methods of making 
polypeptide, pharmaceutical compositions and methods of treating a disorder. Wood meet these 
limitations (Abstract; p296-299; Sequence Comparisons A, C, D). They teach a protein and 
polynucleotide which are 100% identical to SEQ ID NO:l and 9 of the present invention. The artisan 
would immediately envision pharmaceutical compositions as well as methods of treating diseases 
(Examples 108 and 109 of page 306; lines 25-30 of page 132). 
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B. Claims 1-6 and 9-11 are rejected under 35 U.S.C. 102(a) as being anticipated by Liu et al. 
(Genomics 1999). Liu teach a polynucleotide encoding a protein (or fragment thereof) which is 99.9% 
identical to that of the present invention (Sequence Comparison B). This polynucleotide would be 
expected to hybridize to SEQ ID NO:9. 

C. Claims 1-6, 9-11 are rejected under 35 U.S.C. 102(e) as being anticipated by US Patent 
6,183,968. The patent discloses a polynucleotide which is 80% identical to that of the present invention 
(Sequence Comparison E). The artisan would immediately envision that this polynucleotide encodes a 
fragment of SEQ ID NO: 1 . 

7ft Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all obviousness 

rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

A. Claims 12-15 are rejected under 35 U.S.C. 103(a) as being unpatentable over either Liu or US 
Patent 6,183,968 each in view of Wood et al. The claims of the present invention as well as the teachings 
of all cited references are seen in the above rejection under 35 USC 102. Neither Liu, nor '968 teach 
vectors, host cells, or methods of making protein. However, Wood do teach these limitations. 

It would have been obvious to one of ordinary skill in the art at the time the invention was made 
to have modified the invention of Wood et al. by substituting a cDNA in the polycloning region of the 
vector with the polynucleotide either Liu or '968 for the purpose of transfecting a host cell as taught by 
Wood. One of ordinary skill in the art would have been motivated to make this substitution in order to 
express the protein encoded by the introduced DNA in a host cell to perform ligand binding and 
functional assays. There would have been a reasonable expectation of success for a person of ordinary 
skill in the art to make this invention since these techniques are widely used in the art and are highly 
successful. The present invention, therefore, is prima facia obvious over the above references in the 
absence of evidence to the contrary. 
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IL Conclusion 



A. 



No claim is allowable 



Advisory information 

Any inquiry concemmg this communication or earlier communications from the examiner should 
be directed to Robert Landsman whose telephone number is (571) 272-0888. The examiner can normally 
be reached on Monday - Friday from 8:00 AM to 5:00 PM (Eastern time) and alternate Fridays from 8:00 
AM to 5:00 PM (Eastern time). 

If attempts to reach the examiner by telephone are unsuccessful, the Examiner's supervisor, 
Brenda Brumback, can be reached on (571) 272-0961. 

Official papers filed by fax should be directed to (703) 872-9306. Fax draft or informal 
communications with the examiner should be directed to (571) 273-0888. 

Any inquiry of a general nature or relating to the status of this application or proceeding should 
be directed to the Group receptionist whose telephone number is (571) 272-0700. 

Robert Landsman, Ph.D. 
Patent Examiner 

Group 1600 /—V 
July 16, 2004 / / 
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ID AAY41765 Standard; protein; 693 AA. 
XX 

DT 07-DEC-1999 (first entry) 
XX 

DE Human PRO1083 protein sequence. 
XX 

PN W09946281-A2. 

PD 16-SEP-1999. 

PF 08-MAR-1999; 99WO-US005028 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Wood WI, Goddard A, Gurney A, Yuan J, Baker KP, Chen J; 
XX 

DR WPI; 1999-551358/46. 

DR N-PSDB; AAZ34292. 
XX 

SQ Sequence 693 AA; 

Query Match 100.0%; Score 3604; DB 2; Length 693; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 693; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIEN 60 

IMIIMIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIMMIIIIIMIIIIIIIIII 

Db 1 MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIEN 6 0 

Qy 61 SEEALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLL 120 

IIIIIIMIIMIIIIIIIIIIMIIIIIMMIIIIIIIIIIIIIIIIIIMIIIMII 

Db 61 SEEALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLL 12 0 

Qy 121 CFQHQEESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELK 180 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMII 
Db 121 CFQHQEESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELK 180 

Qy 181 RDLQLLSQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKL 24 0 

IIIIIMIMIIIIIMIIMIMIIIMIIIIIIIIIIIIIIMIIMIIIIIMIUI 

Db 181 RDLQLLSQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKL 240 

Qy 241 QPTAGLQDLHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQ 300 ^ 

IMIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIMIIIIIIIII 

Db 241 QPTAGLQDLHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQ 300 

Qy 3 01 DKNSSQVLGEKVLGIWQNTKVANLTEPWLTFQHQLQPKNVTLQCVFWVEDPTLSSPGH 360 

IIIIIIIIIIIIIMIIIMIIIIIMIIMIIIIIIIIIIIilllllMIIIIIMIII 

Db 301 DKNSSQVLGEKVLGIWQNTKVANLTEPWLTFQHQLQPKNVTLQCVFWVEDPTLSSPGH 360 

Qy 361 WSSAGCETVRRETQTSCFCNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCWSALACLV 420 

IIIIIIMIIIIIIIIilllllllllllllllllllMMIIIIIIIIIIIIIIIIIIII 

Db 3 61 WSSAGCETVRRETQTSCFCNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCWSALACLV 420 

Qy 421 TIAAYLCSRVPLPCRRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAI 480 

IIIIIIMIIIIMIIIMMIIMMIIMIIIIIIIIIMMIIIMIIIIIIMIM 

Db 421 TIAAYLCSRVPLPCRRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAI 480 



Qy 481 FLHFSLLTCLSWMGLEGYNLYRLWEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVD 54 0 

IIIIIIMIIIIIIIIIIIIIIMIMIIIIIIIIMMIIIMIIIIIIIIIMMIM 

Db 481 FLHFSLLTCLSWMGLEGYNLYRLWEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVD 54 0 



Qy 

Db 



541 NYGPIILAVHRTPEGVIYPSMCWIRDSLVSYITNLGLFSLVFLFNMAMLATMWQILRLR 600 

MIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

541 NYGPIILAVHRTPEGVIYPSMCWIRDSLVSYITNLGLFSLVFLFNMAMLATMWQILRLR 600 



Qy 601 PHTQKWSHVLTLLGLSLVLGLPWTU^IFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWS 660 

IIIIIIIIIMIIMIIIIIIIIIIIIMIIIIIIIIIIIMIIMIIIIIMIIIIIII 

Db 601 PHTQKWSHVLTLLGLSLVLGLPWALIFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWS 660 

Qy 661 MRLQARGGPSPLKSNSDSARLPISSGSTSSSRI 693 

IIIIMIIIIIIIIIIIIIIMIIIIIIIIIII 
Db 661 MRLQARGGPSPLKSNSDSARLPISSGSTSSSRI 693 
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0 
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0 

0 



US-09-806-537A-1 (1-693) x AF106858 (1-2822) 



Qy 1 MetThrProGlnSerLeuLeuGlnThrThrLeuPheLeuLeuSerLeuLeuPheLeuVal 20 

IMIMMIIMMIMIIMMIMIIMMIIIIIIIMIIIIMIIIIIIIIIIIM 

Db 163 ATGACTCCCCAGTCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTC 222 

Qy 21 GlnGlyAlaHisGlyArgGlyHisArgGluAspPheArgPheCysSerGlnArgAsnGln 40 

IIIIIIIIIINIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIMIIIIIIIII 

Db 223 CAAGGTGCCCACGGCAGGGGCCACAGGGAAGACTTTCGCTTCTGCAGCCAGCGGAACCAG 282 



Qy 


41 


ThrHisArgSerSerLeuHisTyrLysProThrProAspLeuArglleSerlleGluAsn 


60 


Db 


283 


IMIIIIIIIMIIIIMIIIIIIIIIIIIIIMIIIIMIIIIIIIIIIIMIIIIIN 

ACACACAGGAGCAGCCTCCACTACAAACCCACACCAGACCTGCGCATCTCCATCGAGAAC 


342 


Qy 


61 


SerGluGluAlaLeuThrValHisAlaProPheProAlaAlaHisProAlaSerArgSer 


80 


Db 


343 


IMMMIIMIMIIMIIMIIMIIIIIIIIIIIIIMIIMMIIIIIMIIIMI 

TCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGATCC 


402 


Qy 


81 


PheProAspProArgGlyLeuTyrHisPheCysLeuTyrTrpAsnArgHisAlaGlyArg 


100 


Db 


403 


IMIIIIMIMMMIMMIIIIIIIIIMMIIIIMIIIIIIIIIIIMIIMMI 

TTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGA 


462 



Qy 101 LeuHisLeuLeuTyrGlyLysArgAspPheLeuLeuSerAspLysAlaSerSerLeuLeu 120 

MIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 463 TTACATCTTCTCTATGGCAAGCGTGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTC 522 

Qy 121 CysPheGlnHisGlnGluGluSerLeuAlaGlnGlyProProLeuLeuAlaThrSerVal 140 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 

Db 523 TGCTTCCAGCACCAGGAGGAGAGCCTGGCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTC 582 

Qy 141 ThrSerTrpTrpSerProGlnAsnlleSerLeuProSerAlaAlaSerPheThrPheSer 160 

IIMIIIIIIIMMIIMIIIIIIIIMIMIIIIMIIIIIIIIIIIIIIIIIIIIII 

Db 583 ACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCCGCCAGCTTCACCTTCTCC 642 

Qy 161 PheHisSerProProHisThrAlaAlaHisAsnAlaSerValAspMetCysGluLeuLys 180 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 643 TTCCACAGTCCTCCCCACACGGCCGCTCACAATGCCTCGGTGGACATGTGCGAGCTCAAA 702 

Qy 181 ArgAspLeuGlnLeuLeuSerGlnPheLeuLysHisProGlnLysAlaSerArgArgPro 200 

IIIMIIIIMIIIIIIIIIIIIIIIIIIIIMIIMMIIIIMIIIIIIIIIIMIII 

Db 703 AGGGACCTCCAGCTGCTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCC 762 

Qy 201 SerAlaAlaProAlaSerGlnGlnLeuGlnSerLeuGluSerLysLeuThrSerValArg 220 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 763 TCGGCTGCCCCCGCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGA 822 

Qy 221 PheMetGlyAspMetValSerPheGluGluAspArglleAsnAlaThrValTrpLysLeu 240 

MIMIIIIIIIIMMMMIMIIMIIIIIIIIIIIIMIMIIIMIIIIIIMM 

Db 823 TTCATGGGGGACATGGTGTCCTTCGAGGAGGACCGGATCAACGCCACGGTATGGAAGCTC 882 

Qy 241 GlnProThrAlaGlyLeuGlnAspLeuHisIleHisSerArgGlnGluGluGluGlnSer 260 

MMMIMIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIMIIIMIIIMMI 

Db 883 CAGCCCACAGCCGGCCTCCAGGACCTGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGC 942 

Qy 261 GluIleMetGluTyrSerValLeuLeuProArgThrLeuPheGlnArgThrLysGlyArg 280 

IMMIIIIIIIMIMIIIIIIIMIIIIIIIIIMIIMIIIIIIIIIIMIMIMI 

Db 943 GAGATCATGGAGTACTCGGTGCTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGG 1002 

Qy 281 SerGlyGluAlaGluLysArgLeuLeuLeuValAspPheSerSerGlnAlaLeuPheGln 300 

IIIMIIIIIIIIIMIIIIIIMIIMIIIIIIIIIMIIIMIIIIIIIMMIIIII 

Db 1003 AGCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAG 1062 

Qy 301 AspLysAsnSerSerGlnValLeuGlyGluLysValLeuGlylleValValGlnAsnThr 320 

IMMMIMMIIIIIIIIMIIIIIIMIMIMIIIIIIIIIIMMIMIIIMII 

Db 1063 GACAAGAATTCCAGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACC 1122 

Qy 321 LysValAlaAsnLeuThrGluProValValLeuThrPheGlnHisGlnLeuGlnProLys 340 

IIIIIIIMMIMIIIIIIMIIIIIIMMIIIMIIMIIIIIIIIIMIIMIIII 

Db 1123 AAAGTAGCCAACCTCACGGAGCCCGTGGTGCTCACTTTCCAGCACCAGCTACAGCCGAAG 1182 

Qy 341 AsnValThrLeuGlnCysValPheTrpValGluAspProThrLeuSerSerProGlyHis 360 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I M I I I I I I I I 
Db 1183 AATGTGACTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACATTGAGCAGCCCGGGGCAT 1242 

Qy 3 61 TrpSerSerAlaGlyCysGluThrValArgArgGluThrGlnThrSerCysPheCysAsn 380 

IIIIIIIMMIMIIIIIIIIIIIMIIIIIIMIIMMIIIIMIMIIIIIIIIM 

Db 1243 TGGAGCAGTGCTGGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGCAAC 13 02 

Qy 381 HisLeuThrTyrPheAlaValLeuMetValSerSerValGluValAspAlaValHisLys 400 

MMIIIIIIMIMIIIIIMIIMIIMIIIIIIIIIIIIMIIIIIIIMIIIIIM 

Db 1303 CACTTGACCTACTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAG 1362 

Qy 401 HisTyrLeuSerLeuLeuSerTyrValGlyCysValValSerAlaLeuAlaCysLeuVal 420 

IIIIIIIIMIIIIIIIIMIMIIIIIIIIMIMIIMIMIII Mllll IIIIIIM 

Db 1363 CACTACCTGAGCCTCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTC 1422 



Qy 


421 


ThrlleAlaAlaTyrLeuCysSerArgValProLeuProCysArgArgLysProArgAsp 

IIIIMIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMMIIIIUIIIII 


440 


Db 


1423 


ACCATTGCCGCCTACCTCTGCTCCAGGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGAC 


1482 


Qy 


441 


TyrThrlleLysValHisMetAsnLeuLeuLeiiAlaValPheLeuLeuAspThrSerPhe 

IIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIII 

TACACCATCAAGGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTGGACACGAGCTTC 


460 


Db 


1483 


1542 


Qy 


461 


LeuLeuSerGluProValAlaLeuThrGlySerGluAlaGlyCysArgAlaSerAlalle 

IIIIIIIIIIIMMIIIIMIIIIIIMIIIIIMIMIIIIIIIIIIMIIIIIIIII 

CTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATC 


480 


Db 


1543 


1602 


Qy 


481 


PheLeuHisPheSerLeuLeuThrCysLeuSerTrpMetGlyLeuGluGlyTyrAsnLeu 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIIIIIIII 

TTCCTGCACTTCTCCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTC 


500 


Db 


1603 


1662 


Qy 


501 


TyrArgLeuValValGluValPheGlyThrTyrValProGlyTyrLeuLeuLysLeuSer 

IIIIIIIIIIIIMMIIIMIIIIIIIMMIIIIMIIIIIIIMIIIIIIIMIMI 


520 


Db 


1663 


TACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGC 


1722 


Qy 


521 


AlaMetGlyTrpGlyPheProIlePheLeuValThrLeuValAlaLeuValAspValAsp 

IMMIIIIIIIIIMIIIIIIMIIIMMIMIIIMIIIMIIIMUIIIMIIII 


540 


Db 


1723 


GCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGAC 


1782 


Qy 


541 


AsnTyrGlyProIlelleLeuAlaValHisArgThrProGluGlyVallleTyrProSer 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 


560 


Db 


1783 


AACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCTTCC 


1842 


Qy 


561 


MetCysTrplleArgAspSerLeuValSerTyrlleThrAsnLeuGlyLeuPheSerLeu 

IIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

ATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCTG 


580 


Db 


1843 


1902 


Qy 


581 


ValPheLeuPheAsnMetAlaMetLeuAlaThrMetValValGlnlleLeuArgLeuArg 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

GTGTTTCTGTTCAACATGGCCATGCTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGC 


600 


Db 


1903 


1962 


Qy 


601 


ProHisThrGlnLysTrpSerHisValLeuThrLeuLeuGlyLeuSerLeuValLeuGly 

MMIMMIIIIMIIIIIIIIMMMIMIIIIMIMIMIIMIMMMIIIM 


620 


Db 


1963 


CCCCACACCCAAAAGTGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGC 


2022 


Qy 


621 


LeuProTrpAlaLeuIlePhePheSerPheAlaSerGlyThrPheGlnLeuValValLeu 

III II II Mini llllllllllllllllllllllll MINIMI llllllllllllll 

CTGCCCTGGGCCTTGATCTTCTTCTCCTTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTC 


640 


Db 


2023 


2082 


Qy 


641 


TyrLeuPheSerllelleThrSerPheGlnGlyPheLeuIlePhelleTrpTyrTrpSer 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 

TACCTTTTCAGCATCATCACCTCCTTCCAAGGCTTCCTCATCTTCATCTGGTACTGGTCC 


660 


Db 


2083 


2142 


Qy 


661 


MetArgLeuGlnAlaArgGlyGlyProSerProLeuLysSerAsnSerAspSerAlaArg 

MMIIIMIIIMIIIIIIIIIIIMMIIIIIIIIIMIIIIIIIIMI llllll 

ATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTGAAGAGCAACTCAGACTGCGCCAGG 


680 


Db 


2143 


2202 


Qy 


681 


LeuProlleSerSerGlySerThrSerSerSerArglle 693 

IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIM 




Db 


2203 


CTCCCCATCAGCTCGGGCAGCACCTCGTCCAGCCGCATC 2241 





ID AAZ34292 standard; cDNA; 3819 BP. .^g'^^ U< f\ ^ CurfK/QO^f^^W) 

TYV 07-nRr-i999 (first entrv) I V 



DT 07-DEC-1999 (first entry) 
XX 

DE Human PRO1083 nucleotide sequence. 
XX 

PN W09946281-A2. 
XX 

DR WPI; 1999-551358/46. 
DR P-PSDB; AAY41765. 
XX 

PS Claim 2; Fig 203; 530pp; English. 
XX 

SQ Sequence 3819 BP; 789 A; 1221 C; 996 G; 813 T; 0 U; 0 Other; 
Alignment Scores: 

Pred. No.: 4.46e-314 Length: 3819 

Score: 3604.00 Matches: 693 

Percent Similarity: 100.00% Conservative: 0 

Best Local Similarity: 100.00% Mismatches: 0 

Query Match: 100.00% Indels: 0 

DB: 2 Gaps: 0 

US-09-806-537A-1 (1-693) x AAZ34292 (1-3819) 



Qy 


1 


MetThrProGlnSerLeuLeuGlnThrThrLeuPheLeuLeuSerLeuLeuPheLeuVal 
1 1 M 1 1 1 1 I 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 I 1 1 i 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


20 


Db 


214 


1 1 1 1 M 1 1 i M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M M 1 1 1 1 1 1 
ATGACTCCCCAGTCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTC 


273 


Qy 


21 


GlnGlyAlaHisGlyArgGlyHisArgGluAspPheArgPheCysSerGlnArgAsnGln 
i 1 1 1 1 1 1 1 I 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


40 


Db 


274 


1 1 M 1 1 1 1 1 M M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 I M 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 
CAAGGTGCCCACGGCAGGGGCCACAGGGAAGACTTTCGCTTCTGCAGCCAGGGGAACCAG 


333 


Qy 


41 


ThrHisArgSerSerLeuHisTyrLysProThrProAspLeuArglleSerlleGluAsn 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 


60 


Db 


334 


ACACACAGGAGCAGCCTCCACTACAAACCCACACCAGACCTGCGCATCTCCATCGAGAAC 


393 


Qy 


61 


SerGluGluAlaLeuThrValHisAlaProPheProAlaAlaHisProAlaSerArgSer 


80 


Db 


394 


IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGATCC 


453 


Qy 


81 


PheProAspProArgGlyLeuTyrHisPheCysLeuTyrTrpAsnArgHisAlaGlyArg 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 


100 


Db 


454 


TTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGA 


513 


Qy 


101 


LeuHisLeuLeuTyrGlyLysArgAspPheLeuLeuSerAspLysAlaSerSerLeuLeu 

IIIIMIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIMIMIIMIIIIIMIIIIII 


120 


Db 


514 


TTACATCTTCTCTATGGCAAGCGTGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTC 


573 


Qy 


121 


CysPheGlnHisGlnGluGluSerLeuAlaGlnGlyProProLeuLeuAlaThrSerVal 

IIMIMIIIIIIIIIIMIIIMIMMIIIIIIIIIIIIIIIMMIIIMIMIIII 


140 


Db 


574 


TGCTTCCAGCACCAGGAGGAGAGCCTGGCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTC 


633 


Qy 


141 


ThrSerTrpTrpSerProGlnAsnlleSerLeuProSerAlaAlaSerPheThrPheSer 


160 


Db 


634 


IIIIIMIIMMIIIMIIIIIIIIIIIIIIIIIIIIMIIIMIIMIIIIIIIMM 

ACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCCGCCAGCTTCACCTTCTCC 


693 


Qy 


161 


PheHisSerProProHisThrAlaAlaHisAsnAlaSerValAspMetCysGluLeuLys 

IIMIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIMIIIIIIIMIIIII 


180 


Db 


694 


TTCCACAGTCCTCCCCACACGGCCGCTCACAATGCCTCGGTGGACATGTGCGAGCTCAAA 


753 


Qy 


181 


ArgAspLeuGlnLeuLeuSerGlnPheLeuLysHisProGlnLysAlaSerArgArgPro 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 


200 


Db 


754 


AGGGACCTCCAGCTGCTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCC 


813 



Qy 


201 


SerAlaAlaProAlaSerGlnGlnLeuGlnSerLeuGluSerLysLeuThrSerValArg 

1 1 1 M M 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 


220 


Db 


814 


1 1 1 1 1 1 1 i 1 j 1 1 1 1 1 1 1 1 1 1 1 ' ' 1 1 ' ' 1 ' ' 1 ' ' 1 1 ' ' 1 1 • 1 ' 

TCGGCTGCCCCCGCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGA 


873 


Qy 


221 


PheMetGlyAspMetValSerPheGluGluAspArglleAsnAlaThrValTrpLysLeu 

1 M M 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 


240 


Db 


874 


1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 >>' 1 < 1 ■ 

TTCATGGGGGACATGGTGTCCTTCGAGGAGGACCGGATCAACGCCACGGTGTGGAAGCTC 


933 


Qy 


241 


GlnProThrAlaGlyLeuGlnAspLeuHisIleHisSerArgGlnGluGluGluGlnSer 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 


260 


Db 


934 


1 1 1 1 1 1 1 f 1 1 1 { 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 < 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 > 1 1 

CAGCCCACAGCCGGCCTCCAGGACCTGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGC 


993 


Qy 


261 


GluIleMetGluTyrSerValLeuLeuProArgThrLeuPheGlnArgThrLysGlyArg 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 


280 


Db 


994 


I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 F 1 1 1 1 f 1 1 1 1 1 1 1 1 p 1 1 1 1 1 t 1 1 1 1 1 1 1 1 r 1 t 1 1 1 I < I I 1 1 < I I 

GAGATCATGGAGTACTCGGTGCTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGG 


1053 


Qy 


281 


SerGlyGluAlaGluLysArgLeuLeuLeuValAspPheSerSerGlnAlaLeuPheGln 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


300 


Db 


1054 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > 1 1 1 1 1 [ 1 1 
AGCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAG 


1113 


Qy 


301 


AspLysAsnSerSerGlnValLeuGlyGluLysValLeuGlylleValValGlnAsnThr 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


320 


Db 


1114 


1 1 J 1 1 [ 1 J I J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > I 1 '< 1 r r 1 P 1 1 1 J 1 1 1 ' 1 1 1 1 1 1 1 1 1 1 1 1 * J 

GACAAGAATTCCAGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACC 


1173 


Qy 


321 


LysValAlaAsnLeuThrGluProValValLeuThrPheGlnHisGlnLeuGlnProLys 

1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


340 


Db 


1174 


AAAGTAGCCAACCTCACGGAGCCCGTGGTGCTCACTTTCCAGCACCAGCTACAGCCGAAG 


1233 


Qy 


341 


AsnValThrLeuGlnCysValPheTrpValGluAspProThrLeuSerSerProGlyHis 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


360 


Db 


1234 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 J p 1 

AATGTGACTCTGGAATGTGTGTTCTGGGTTGAAGACCGCACATTGAGCAGCCCGGGGCAT 


1293 


Qy 


361 


TrpSerSerAlaGlyCysGluThrValArgArgGluThrGlnThrSerCysPheCysAsn 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


380 


Db 


1294 


1 1 1 p 1 1 1 1 1 1 1 [ 1 1 1 1 I 1 p 1 1 p 1 1 1 1 1 [< p 1 1 1 1 1 1 1 1 1 1 1 1 p p 1 1 1 1 1 1 1 J 1 1 p t 1 1 1 1 
TGGAGCAGTGCTGGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGCAAC 


1353 


Qy 


381 


HisLeuThrTyrPheAlaValLeuMetValSerSerValGluValAspAlaValHisLys 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 


400 


Db 


1354 


1 1 1 P 1 1 1 1 1 1 1 1 1 1 P 1 < 1 1 1 1 P S P 1 1 1 t 1 P 1 1 < P 1 1 1 1 1 1 1 I P P P 1 1 1 1 1 I 1 1 1 P P P t P 1 

CACTTGACCTACTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAG 


1413 


Qy 


401 


HisTyrLeuSerLeuLeuSerTyrValGlyCysValValSerAlaLeuAlaCysLeuVal 

1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


420 


Db 


1414 


1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 [< 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CACTACCTGAGCCTCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTC 


1473 


Qy 


421 


ThrlleAlaAlaTyrLeuCysSerArgValProLeuProCysArgArgLysProArgAsp 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 


440 


Db 


1474 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 < 1 1 

ACCATTGCCGCCTACCTCTGCTCCAGGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGAC 


1533 


Qy 


441 


TyrThrlleLysValHisMetAsnLeuLeuLeuAlaValPheLeuLeuAspThrSerPhe 

1 1 1 1 M M 1 M M 1 1 1 M 1 1 1 1 1 1 11 1 M 1 1 1 1 1 M M M 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 


460 


Db 


1534 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 

TACACCATCAAGGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTGGACACGAGCTTC 


1593 


Qy 


461 


LeuLeuSerGluProValAlaLeuThrGlySerGluAlaGlyCysArgAlaSerAlalle 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


480 


Db 


1594 


1 1 1 [ 1 P 1 1 P P 1 1 1 1 1 [ 1 1 P 1 1 1 P 1 1 1 I J 1 r J 1 1 1 1 1 1 1 1 1 P P 1 1 t 1 1 p 1 P 1 P p 1 P P P 1 p f 

CTGCTCAGCGAGCCGGTGGCCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATC 


1653 


Qy 


481 


PheLeuHisPheSerLeuLeuThrCysLeuSerTrpMetGlyLeuGluGlyTyrAsnLeu 


500 


Db 


1654 


llllll MINIM IIIIIIIIMIIIIMIII.IIIIII II lllllll Mil III Mill 

TTCCTGCACTTCTCCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTC 


1713 


Qy 


501 


TyrArgLeuValValGluValPheGlyThrTyrValProGlyTyrLeuLeuLysLeuSer 

IIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIMIMIIIIIIIIIIIIIIIIIIII 


520 


Db 


1714 


TACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGC 


1773 



Qy 


521 


AlaMetGlyTrpGlyPheProllePheLeuValThrLeuValAlaLeuValAspValAsp 


540 


Db 


1774 


iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiliiiiiiiiiiiiiiiiiiiiiiiiiiiii 

GCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGAC 


1833 


Qy 


541 


AsnTyrGlyProIlelleLeuAlaValHisArgThrProGluGlyVallleTyrProSer 


560 


Db 


1834 


MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

AACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCTTCC 


1893 


Qy 


561 


MetCysTrpIleArgAspSerLeuValSerTyrlleThrAsnLeuGlyLeuPheSerLeu 580 
1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ] 1 1 1 1 1 


Db 


1894 


1 ' ' ' ) 1 1 1 1 1 M M M 1 i 1 1 1 M M M M M M 1 M M 1 M 1 M M [ j 1 1 1 M [ 1 

AKjICjLTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCTG 


1953 


Qy 


581 


ValPheLeuPheAsnMetAlaMetLeuAlaThrMetValValGlnlleLeuArgLeuArq 

IIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIII 


600 


Db 


1954 


GlLriilCI CjTTCAACATGGCCATGCTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGC 


2 013 


Qy 


601 


ProHisThrGlnLysTrpSerHisValLeuThrLeuLeuGlyLeuSerLeuValLeuGly 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


620 


Db 


2014 


CCCCACACCCAAAAGTGGTCACATGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGC 


2073 


Qy 


621 


LeuProTrpAlaLeuIlePhePheSerPheAlaSerGlyThrPheGlnLeuValValLeu 


640 


Db 


2074 


MIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIlllMi 

CTGCCCTGGGCCTTGATCTTCTTCTCCTTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTC 


2133 


Qy 


641 


TyrLeuPheSerllelleThrSerPheGlnGlyPheLeuIlePhelleTrpTyrTrpSer 


660 


Db 


2134 


IIIIIIMIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

TACCTTTTCAGCATCATCACCTCCTTCCAAGGCTTCCTCATCTTCATCTGGTACTGGTCC 


2193 


Qy 


661 


MetArgLeuGlnAlaArgGlyGlyProSerProLeuLysSerAsnSerAspSerAlaArq 


680 


Db 


2194 


IIMIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMllMIIIIIIIIIII 

ATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCCAGG 


2253 


Qy 


681 


LeuProlleSerSerGlySerThrSerSerSerArglle 693 




Db 


2254 


MIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

CTCCCCATCAGCTCGGGCAGCACCTCGTCCAGCCGCATC 2292 





AAZ34292 



ID AAZ34292 Standard; cDNA; 3819 BP. 

AC AAZ34292; ' 
XX 

PN W09946281-A2 . 
XX 

SQ Sequence 3819 BP; 789 A; 1221 C; 996 G; 813 T; , 0 U; 0 Other; 

Query Match 98.2%; Score 3567.6; DB 2; Length 3819; 

Best Local Similarity 99.9%; Pred. No. 0; 



Matches 


3570; Conservative 0; Mismatches 4; Indels 0; Gaps 


0; 


Qy 


59 


AGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGAAAATGACTCCCCAGTCGCTGCTGCA 


118 


Db 


177 


IIIIIIIIIIIMIMMIIMIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIMI 

AGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGAAAATGACTCCCCAGTCGCTGCTGCA 


236 


Qy 


119 


GACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCA 


178 


Db 


237 


IIIIIMMMMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

GACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCA 


296 


Qy 


179 


CAGGGAAGACTTTCGCTTCTGCAGCCAGCGGAACCAGACACACAGGAGCAGCCTCCACTA 


238 


Db 


297 


1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGGGAAGACTTTCGCTTCTGCAGCCAGCGGAACCAGACACACAGGAGCAGCCTCCACTA 


356 



Qy 

Db 



23 9 CAAACCCACACCAGACCTGCGCATCTCCATCGAGAACTCCGAAGAGGCCCTCACAGTCCA 2 98 

IIIIIIIIIIIIIIIIIMMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

3 57 CAAACCCACACCAGACCTGCGCATCTCCATCGAGAACTCCGAAGAGGCCCTCACAGTCCA 416 



Qy 


299 


TGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGATCCTTCCCTGACCCCAGGGGCCTCTA 


358 


Db 


417 


IIIIIIIIIIIIIIIIIIIM'IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

TGCCCCTTTCCCTGCAGCCCACCCTGCTTCCCGATCCTTCCCTGACCCCAGGGGCCTCTA 


476 


Qy 


359 


CCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGATTACATCTTCTCTATGGCAAGCG 


418 


Db 


477 


IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CCACTTCTGCCTCTACTGGAACCGACATGCTGGGAGATTACATCTTCTCTATGGCAAGCG 


536 


Qy 


419 


TGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAG 


478 


Db 


537 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIIIMIIIIIIIIIIIIII 

TGACTTCTTGCTGAGTGACAAAGCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAG 


596 


Qy 


479 


CCTGGCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAA 


538 


Db 


597 


IIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

CCTGGCTCAGGGCCCCCCGCTGTTAGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAA 


656 


Qy 


539 


CATCAGCCTGCCCAGTGCCGCCAGCTTCACCTTCTCCTTCCACAGTCCTCCCCACACGGC 


598 


Db 


657 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CATCAGCCTGCCCAGTGCCGCCAGCTTCACCTTCTCCTTCCACAGTCCTCCCCACACGGC 


716 


Qy 


599 


CGCTCACAATGCCTCGGTGGACATGTGCGAGCTCAAAAGGGACCTCCAGCTGCTCAGCCA 


658 


Db 


717 


IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII 

CGCTCACAATGCCTCGGTGGACATGTGCGAGCTCAAAAGGGACCTCCAGCTGCTCAGCCA 


776 


Qy 


659 


GTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCCTCGGCTGCCCCCGCCAGCCAGCA 


718 


Db 


777 


IIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIM 

GTTCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCCTCGGCTGCCCCCGCCAGCCAGCA 


836 


Qy 


719 


GTTGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTT 


778 


Db 


837 


MMIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMM 

GTTGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTT 


896 


Qy 


779 


CGAGGAGGACCGGATCAACGCCACGGTGTGGAAGCTCCAGCCCACAGCCGGCCTCCAGGA 


838 


Db 


897 


IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIII 

CGAGGAGGACCGGATCAACGCCACGGTGTGGAAGCTCCAGCCCACAGCCGGCCTCCAGGA 


956 


Qy 


839 


CCTGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTGCT 


898 


Db 


957 


IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CCTGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTGCT 


1016 


Qy 


899 


GCTGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGGAGCGGGGAGGCTGAGAAGAGACT 


958 


Db 


1017 


MMMMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GCTGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGGAGCGGGGAGGCTGAGAAGAGACT 


1076 


Qy 


959 


CCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGACAAGAATTCCAGCCAAGTCCT 


1018 


Db 


1077 


IIMMIMIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

CCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGACAAGAATTCCAGCCAAGTCCT 


1136 


Qy 


1019 


GGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACCAAAGTAGCCAACCTCACGGAGCC 


1078 


Db 


1137 


MIMMIIMIIMIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIIIIIIIIMIM 

GGGTGAGAAGGTCTtGGGGATTGTGGTACAGAACACCAAAGTAGCCAACCTCACGGAGCC 


1196 


Qy 


1079 


CGTGGTGCTCACCTTCCAGCACCAGCTACAGCCGAAGAATGTGACTCTGCAATGTGTGTT 


1138 


Db 


1197 


IMMIIMIII illllllllllllllllllllllllllllMIIIIIIIIIIIIIIII 

CGTGGTGCTCACTTTCCAGCACCAGCTACAGCCGAAGAATGTGACTCTGCAATGTGTGTT 


1256 


Qy 


1139 


CTGGGTTGAAGACCCCACATTGAGCAGCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGAC 


1198 


Db 


1257 


1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTGGGTTGAAGACCCCACATTGAGCAGCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGAC 


1316 




Qy 


1199 


CGTCAGGAGAGAAACCCAAACATCCTGCTTCTGCAACCACTTGACCTACTTTGCAGTGCT 


1258 


Db 


1317 


1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGTCAGGAGAGAAACCCAAACATCCTGCTTCTGCAACCACTTGACCTACTTTGCAGTGCT 


1376 


Qy 


1259 


GATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGCCTCCTCTCCTA 


1318 


Db 


1377 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIMIIIMIII 

GATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGCCTCCTCTCCTA 


1436 


Qy 


1319 


CGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTC 


1378 


Db 


1437 


IIIIIIIMIIII-IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

CGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTC 


1496 


Qy 


1379 


CAGGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAA 


1438 


Db 


1497 


IIMIIMIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

CAGGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAA 


1556 


Qy 


1439 


CCTGCTGCTGGCCGTCTTCCTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCT 


1498 


Db 


1557 


IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

CCTGCTGCTGGCCGTCTTCCTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCT 


1616 


Qy 


1499 


GACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATCTTCCTGCACTTCTCCCTGCTCAC 


1558 


Db 


1617 


IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIMMIIIIIMIIIIII 

GACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATCTTCCTGCACTTCTCCCTGCTCAC 


1676 


Qy 


1559 


CTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTGGTGGAGGTCTT 


1618 


Db 


1677 


IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTGGTGGAGGTCTT 


1736 


Qy 


1619 


TGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCAT 


1678 


Db 


1737 


MMMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

TGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCAT 


1796 


Qy 


1679 


CTTTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGC 


1738 


Db 


1797 


IIIMIIMIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

CTTTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGC 


1856 


Qy 


1739 


TGTGCATAGGACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCT 


1798 


Db 


1857 


IMMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIII 

TGTGCATAGGACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCT 


1916 


Qy 


1799 


GGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCTGGTGTTTCTGTTCAACATGGCCAT 


1858 


Db 


1917 


M 1 M 1 M 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCTGGTGTTTCTGTTCAACATGGCCAT 


1976 


Qy 


1859 


GCTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCCAAAAGTGGTCACA 


1918 


Db 


1977 


IMNIMIIIIIIIIIIIIIIIIIIIIMIMIIIIMIIIIMIIIIIIIIIIIIIII 

GCTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCCAAAAGTGGTCACA 


2036 


Qy 


1919 


TGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTT 


1978 


Db 


2037 


1 1 1 1 N 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTT 


2096 


Qy 


1979 


CTCCTTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCTC 


2038 


Db 


2097 


IIIIIIIMIMMIIIIIMIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

CTCCTTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCTC 


2156 


Qy 


2039 


CTTCCAAGGCTTCCTCATCTTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTGG 


2098 


Db 


2157 


IIIMIIIIIIIIIIIIIIMIIIIIIIIIMIMIIIIIMIIIMIIIIIIIIIIIM 

CTTCCAAGGCTTCCTCATCTTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTGG 


2216 


Qy 


2099 


CCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCCAGGCTCCCCATCAGCTCGGGCAGCAC 


2158 


Db 


2217 


IMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCCAGGCTCCCCATCAGCTCGGGCAGCAC 


2276 




Qy 


2159 


CTCGTCCAGCCGCATCTAGGCCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATGCGG 


2218 


Db 


2277 


IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

CTCGTCCAGCCGCATCTAGGCCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATGCGG 


2336 


Qy 


2219 


CCTCGTCGCACACTGCCTGTGGCCCCCGAGCCCGGCCCAGCCCCAGGCCAGTCAGCCGCA 


2278 


Db 


2337 


llllllllllllllllllllllllllllllll lllllllllllllllllllllllllll 
CCTCGTCGCACACTGCCTGTGGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGCA 


2396 


Qy 


2279 


GACTTTGGAAAGCCCAACGACCATGGAGAGATGGGCCGTTGCCATGGTGGACGGACTCCC 


2338 


Db 


2397 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

GACTTTGGAAAGCCCAACGACCATGGAGAGATGGGCCGTTGCCATGGTGGACGGACTCCC 


2456 


Qy 


2339 


GGGCTGGGCTTTTGAATTGGCCTTGGGGACTACTCGGCTCTCACTCAGCTCCCACGGGAC 

IIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GGGCTGGGCTTTTGAATTGGCCTTGGGGACTACTCGGCTCTCACTCAGCTCCCACGGGAC 


2398 


Db 


2457 


2516 


Qy 


2399 


TCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTGTCCCCACATCTGTCCCAACCCAGCTG 


2458 


Db 


2517 


IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIII 

TCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTGTCCCCACATCTGTCCCAACCCAGCTG 


2576 


Qy 


2459 


GAGGCCTGGTCTCTCCTTATAACCCCTGGGCCCAGCCCTCATTGCTGGGGGCCAGGCCTT 

iiiiiMiiiiiiiiiiii iiiiMiiiiiiiiiiiiiiiriiiiiiiiiiiiiiiiii 

GAGGCCTGGTCTCTCCTTACAACCCCTGGGCCCAGCCCTCATTGCTGGGGGCCAGGCCTT 


2518 


Db 


2577 


2636 


Qy 


2519 


GGATCTTGAGGGTCTGGCACATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGTGGC 


2578 


Db 


2637 


IMMIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMMI 

GGATCTTGAGGGTCTGGCACATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGTGGC 


2696 


Qy 


2579 


TCCAGTTGCTCTGTCTCTCGTGGTCACCCTGAGGGCACTCTGCATCCTCTGTCATTTTAA 


2638 


Db 


2697 


IMIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMMIIMI 

TCCAGTTGCTCTGTCTCTCGTGGTCACCCTGAGGGCACTCTGCATCCTCTGTCATTTTAA 


2756 


Qy 


2639 


CCTCAGGTGGCACCCAGGGCGAATGGGGCCCAGGGCAGACCTTCAGGGCCAGAGCCCTGG 


2698 


Db 


2757 


IIIIIIMMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIII 

CCTCAGGTGGCACCCAGGGCGAATGGGGCCCAGGGCAGACCTTCAGGGCCAGAGCCCTGG 


2816 


Qy 


2699 


CGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGCAGCTCGCCTACCTCTGAGCCCAGGCC 


2758 


Db 


2817 


MIIIIMIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

CGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGCAGCTCGCCTACCTCTGAGCCCAGGCC 


2876 


Qy 


2759 


CCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTTCCCTGGGGTTCTCCTCCTCTCC 


2818 


Db 


2877 


MIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

CCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTTCCCTGGGGTTCTCCTCCTCTCC 


2936 


Qy 


2819 


CAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTCCCCGATTCCAATGCTGTTTTTTG 


2878 


Db 


2937 


IMMIIIIIIIIMIIIIIIIIIIIMIIIIMIIIIIIMIIIIIIIIIIIIIIIIM 

CAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTCCCCGATTCCAATGCTGTTTTTTG 


2996 


Qy 


2879 


GGGAGTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGTCTACTGCACAAGCC 


2938 


Db 


2997 


IMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

GGGAGTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGTCTACTGCACAAGCC 


3056 


Qy 


2939 


TCGGCCTGCCCCTGAGCCAGGCTCGGTACCGATGCGTGGGCTGGGCTAGGTCCCTCTGTC 


2998 


Db 


3057 


1 1 1 1 i 1 1 1 M M 1 1 1 M 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

TCGGCCTGCCCCTGAGCCAGGCTCGGTACCGATGCGTGGGCTGGGCTAGGTCCCTCTGTC 


3116 


Qy 


2999 


CATCTGGGCCTTTGTATGAGCTGCATTGCCCTTGCTCACCCTGACCAAGCACACGCCTCA 


3058 


Db 


3117 


IIIIIMMMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

CATCTGGGCCTTTGTATGAGCTGCATTGCCCTTGCTCACCCTGACCAAGCACACGCCTCA 


3176 


Qy 


3059 


GAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCAAGAACTGTGGACCATGCCAGTC 


3118 


Db 


3177 


IIMIIIIIMIIIMIIIMMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIll 

GAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCAAGAACTGTGGACCATGCCAGTC 


3236 



Qy 

Db 



3119 CCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACTGACCTCCTCTGGTGACACTGG 3178 

IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

3237 CCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACTGACCTCCTCTGGTGACACTGG 3296 



Qy 3179 CCTAGAGCCTGACACTCTCCTAAGAGGTTCTCTCCAAGCCCCCAAATAGCTCCAGGCGCC 3 23 8 

IMIIIIIMIIMIIIIIIIIIMMIIIIIIIIIIIIIIMIIIMIMIIIIIIMI 

Db 3297 CCTAGAGCCTGACACTCTCCTAAGAGGTTCTCTCCAAGCCCCCAAATAGCTCCAGGCGCC 3356 

Qy 323 9 CTCGGCCGCCCATCATGGTTAATTCTGTCCAACAAACACACACGGGTAGATTGCTGGCCT 3298 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 3357 CTCGGCCGCCCATCATGGTTAATTCTGTCCAACAAACACACACGGGTAGATTGCTGGCCT 3416 

Qy 3299 GTTGTAGGTGGTAGGGACACAGATGACCGACCTGGTCACTCCTCCTGCCAACATTCAGTC 3358 

IIIIIIIIJIMIIMIIIIIIIIIIilllllllllllllllllllllllllllllllll 

Db 3417 GTTGTAGGTGGTAGGGACACAGATGACCGACCTGGTCACTCCTCCTGCCAACATTCAGTC 34 76 

Qy 3359 TGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTACAGGGACAGGGAGCCATCA 3418 

MIIIIIIIMMIIIIIMIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIII 

Db 34 77 TGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTACAGGGACAGGGAGCCATCA 3 536 

Qy 3419 TTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGACCTTGA 34 78 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 53 7 TTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGACGTTGA 3596 

Qy 3479 AGATGGGAAGGATGTTCTTTTTACGTACCAATTCTTTTGTCTTTTGATATTAAAAAGAAG 3538 

IIIMIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 3597 AGATGGGAAGGATGTTCTTTTTACGTACCAATTCTTTTGTCTTTTGATATTAAAAAGAAG 365G 

Qy 353 9 TACATGTTCATTGTAGAGAATTTGGAAACTGTAGAAGAGAATCAAGAAGAAAAATAAAAA 3598 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 3 657 TACATGTTCATTGTAGAGAATTTGGAAACTGTAGAAGAGAATCAAGAAGAAAAATA/^W^ 3716 

Qy 3 599 TCAGCTGTTGTAATCACCTAGCAAAAAAAAAAAA 3 632 

lllllllllllllll llllllllllllllllll 
Db 3717 TCAGCTGTTGTAATCGCCTAGCAAAAAAAAAAAA 3750 
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Qy 601 CTCACAATGCCTCGGTGGACATGTGCGAGCTCAAAAGGGACCTCCAGCTGCTCAGCCAGT 660 

I III II I llllllllllllllllllllllllllllllllllll llllllllllllll 

Db 36 CGCACAATNCCTCGGTGGACATGTGCGAGCTCAAAAGGGACCTCCNGCTGCTCAGCCAGT 95 

Qy 6 61 TCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCCTCGGCTGCCCCCGCCAGCCAGCAGT 720 

II llllllllllllll III! II I III! I II llllllllllllll II llllllllllllll 

Db 96 TCCTGAAGCATCCCCAGAAGGCCTCAAGGAGGCCGTCGGCTGCCCCCGCCAGCCAGCAGT 155 

Qy 721 TGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTTCG 780 

{IMIMIMIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIMIIIIII 

Db 156 TGCAGAGCCTGGAGTCGAAACTGACCTCTGTGAGATTCATGGGGGACATGGTGTCCTTCG 215 

Qy 781 AGGAGGACCGGATCAACGCCACGGTGTGGAAGCTCCAGCCCACAGCCGGCCTCCAGGACC 84 0 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 216 AGGAGGACCGGATCAACGCCACGGTGTGGAAGCTCCAGCCCACAGCCGGCCTCCAGGACC 275 

Qy 841 TGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTGCTGC 900 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 2 76 TGCACATCCACTCCCGGCAGGAGGAGGAGCAGAGCGAGATCATGGAGTACTCGGTGCTGC 3 3 5 

Qy 901 TGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGGAGCGGGGAGGCTGAGAAGAGACTCC 960 

I Mill llllllllllllll II lllll II llllllllllllll llllllllllllll III 

Db 336 TGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGGAGCGGGGAGGCTGAGAAGAGACTCC 3 95 

Qy 961 TCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGACAAGAATTCCAGCCAAGTCCTGG 1020 

INI II II lllll llllllllllllll II II I lllll III III II II III 11 1 1 II III I 

Db 396 TCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGACAAGAATTCCAGCCAAGTCCTGG 4 55 

Qy 1021 GTGAGAAGGTCTTGGGGATTGTGGTACAGAACACCAAAGTAGCCAACCTCACGGAGCCCG 1080 

IIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMMIIIIMIIIIIIIIIII 

Db 456 GTGAGAAGGTCTTGGGGATTGTGGTACAGAACACCAAAGTAGCCAACCTCACGGAGCCCG 515 

Qy 1081 TGGTGCTCACCTTCCAGCACCAGCTACAGCCGAAGAATGTGACTCTGCAATGTGTGTTCT 114 0 

1 1 1 1 1 1 1 M n I M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M I M 1 1 1 1 1 1 M M I M 1 1 1 

Db 516 TGGTGCTCACCTTCCAGCACCAGCTACAGCCGAAGAATGTGACTCTGCAATGTGTGTTCT 5 75 

Qy 1141 GGGTTGAAGACCCCACATTGAGCAGCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGACCG 1200 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 576 GGGTTGAAGACCCCACATTGAGCAGCCCGGGGCATTGGAGCAGTGCTGGGTGTGAGACCG 635 

Qy 12 01 TCAGGAGAGAAACCCAAACATCCTGCTTCTGCAACCACTTGACCTACTTTGCAGTGCTGA 12 60 

M 1 1 1 1 1 1 1 II 1 1 1 1 1 II II II 1 1 II II II 1 1 II II II II II II II 1 1 1 II 1 1 II II II I 

Db 636 TCAGGAGAGAAACCCAAACATCCTGCTTCTGCAACCACTTGACCTACTTTGCAGTGCTGA 695 ' 

Qy 1261 TGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGCCTCCTCTCCTACG 1320 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 696 TGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGCCTCCTCTCCTACG 755 

Qy 1321 TGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTCCA 1380 

IIIIIMIIMMIIMIIIIIIMIMIIIIMIMIIIIIIIIIIIMMIMIIIII 

Db 756 TGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTACCTCTGCTCCA 815 

Qy 1381 GGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAACC 1440 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 816 GGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCACATGAACC 875 



Qy 1441 TGCTGCTGGCCGTCTTCCTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTGA 1500 

IIIIMIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 876 TGCTGCTGGCCGTGTTCCTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGGCCCTGA 93 5 

Qy 1501 CAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATCTTCCTGCACTTCTCCCTGCTCACCT 1560 

IIIIIIIIIIIIIIIIMIIIIIIIIIIIMIMIIIIIIIIIIMIIIIIIIIIIIMI 

Db 93 6 CAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATCTTCCTGCACTTCTCCCTGCTCACCT 995 

Qy 1561 GCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTGGTGGAGGTCTTTG 1620 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 996 GCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTGGTGGAGGTCTTTG 1055 

Qy 1621 GCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCATCT 1680 

IIIIIMIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 1056 GCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCATCT 1115 

Qy 1681 TTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTG 1740 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1116 TTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTG 1175 

Qy 1741 TGCATAGGACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGG 1800 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1176 TGCATAGGACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGG 1235 

Qy 1801 TCAGCTACATCACCAACCTGGGCCTCTTCAGCCTGGTGTTTCTGTTCAACAT-GGCCATG 1859 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M I 1 1 1 1 1 1 1 

Db 1236 TCAGCTACATCACCAACCTGGGCCTCTTCAGCCTGGTGTTTCTGTTCAACATGGGCCATG 1295 

Qy 1860 CTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCCAAAAGT-GGTCACA 1918 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII lllllll 

Db 1296 CTAGCCACCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCCAAAAGTGGGTCACA 1355 

Qy 1919 TGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCT-GGGCCTTGATCTTCT 1977 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII IIIIMIIIIIIIII 

Db 1356 TGTGCTGACACTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGGCCTTGATCTTCT 1415 

Qy 1978 TCTCCTTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCT 2 037 

IIIIIMIIIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 1416 TCTCCTTTGCTTCTGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCT 14 75 

Qy 2 03 8 CCTTCCAAGGCTTCCTCATCTTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTG 2097 

MIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 1476 CCTTCCAAGGCTTCCTCATCTTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTG 1535 

Qy 2 098 GCCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCCAGGCTCCCCATCAGCTCGGGCAGCA 2157 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 153 6 GCCCCTCCCCTCTGAAGAGCAACTCAGACAGCGCCAGGCTCCCCATCAGCTCGGGCAGCA 1595 

Qy 2158 CCTCGTCCAGCCGCATCTAGGCCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATGCG 2217 

IIIIMIMIIIIIIIMIIIIIIIIIIIMIIMMMIIIIMMIIIMIIIIIIM 

Db 1596 CCTCGTCCAGCCGCATCTAGGCCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATGCG 1655 

Qy 2218 GCCTCGTCGCACACTGCCTGTGGCCCCCGAGCCCGGCCCAGCCCCAGGCCAGTCAGCCGC 2277 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII llllllllllllllllllllllllll 

Db 16 56 GCCTCGTCGCACACTGCCTGTGGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGC 1715 

Qy 2278 AGACTTTGGAAAGCCCAACGACCATGGAGAGATGGGCCGTTGCCATGGTGGAC-GGACTC 2336 

IMIIIIIUIIIIIIMMIIMIIIIIIIIMIMIIIIIIIIIIMIIM llllll 

Db 1716 AGACTTTGGAAAGCCCAACGACCATGGAGAGATGGGCCGTTGCCATGGTGGACGGGACTC 1775 

Qy 233 7 CCGGGCTGGGCTTTTGAATTGGCCTTGGGGACTACTCGGCTCTCACTCAGCTCCCACGGG 2396 

IMIIIMIIIIIIIIIMIIIMIIIIIMIIIIIIIIIIIIIIMIIIIIMIIIIII 

Db 1776 CCGGGCTGGGCTTTTGAATTGGCCTTGGGGACTACTCGGCTCTCACTCAGCTCCCACGGG 183 5 



Qy 2397 ACTCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTGTCCCCACATCTGTCCCAACCCAGC 2456 

IIIMIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIMMIMIIIIIIIMIIIII 

Db 1836 ACTCAGAAGTGCGCCGCCATGCTGCCTAGGGTACTGTCCCCACATCTGTCCCAACCCAGC 1895 

Qy 24 57 TGGAGGCCTGGTCTCTCCTTATAACCCCT-GGGCCCAGCCCTCATTGCTGGGGGCCAGGC 2515 

■ lllllllllllllllllllll lllllll IIIIIIIMMIIIMIIIIIIIIIIMM 

Db 1896 TGGAGGCCTGGTCTCTCCTTACAACCCCTGGGGCCCAGCCCTCATTGCTGGGGGCCAGGC 1955 

Qy 2516 CTTGGATCTTGAGGGTCTGGCACATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGT 25 75 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1956 CTTGGATCTTGAGGGTCTGGCACATCCTTAATCCTGTGCCCCTGCCTGGGACAGAAATGT 2015 

Qy 2576 GGCTCCAGTTGCTCTGTCTCTCGTGGTCACCCTGAGGGCACTCTGCATCCTCTGTCATTT 2635 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db .2016 GGCTCCAGTTGCTCTGTCTCTCGTGGTCACCCTGAGGGCACTCTGCATCCTCTGTCATTT 2075 

Qy 263 6 TAACCTCAGGTGGCACCCAGGGCGAATGGGGCCCAGGGCAGACCTTCAGGGCCAGAGCCC 2695 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIMMIIIIIIIIII 

Db 2 076 TAACCTCAGGTGGCACCCAGGGCGAATGGGGCCCAGGGCAGACCTTCAGGGCCAGAGCCC 2135 

Qy 2 696 TGGCGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGCAGCTCGCCTACCTCTGAGCCCAG 2755 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

Db 2136 TGGCGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGCAGCTCGCCTACCTCTGAGCCCAG 2195 

Qy 2756 GCCCCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTTCCCTGGGGTTCTCCTCCTC 2815 

MIIIMIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMIIIIII 

Db 2196 GCCCCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTTCCCTGGGGTTCTCCTCCTC 2255 

Qy 2 816 TCCCAGGGCCTCCTTGCTCCTTCGTTCACAGCT-GGGGGTCCCCGATTCCAATGCTGTTT 2874 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII llllllllllllllllllllllllll 
Db 2256 TCCCAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGGTCCCCGATTCCAATGCTGTTT 2315 

Qy 2875 TTTGGGGAGTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGTCTACTGCACA 2934 

MIIIIIIMMIIMIMIIIIIIIIMIIIIIIIIIIIMIIIIIIIIMIIIMIM 

Db 2316 TTTGGGGAGTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGTTTGTCTACTGCACA 23 75 

Qy 2 935 AGCCTCGGCCTGCCCCT-GAGCCAGGCTCGGTACCGATGCGTGGGCTGGGCTAGGTCCCT 2 993 

IMIMMMIIIIIII IIIIMIMMIIIIIIIMMIIMIIIIIIMIIIIIIM 

Db .2376 AGCCTCGGCCTGCCCCTGGAGCCAGGCTCGGTACCGATGCGTGGGCTGGGCTAGGTCCCT 243 5 

Qy 2994 CTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTGCTCACCCTGACCAAGCACACG 3 053 

IIMMMIIMIMIIMIMIMMIMIIIIMIIIIIIIMIIIIMMIMIIII 

Db 2436 CTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTGCTCACCCTGACCAAGCACACG 2495 

Qy 3054 CCTCAGAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCAAGAACTGTGGACCATGC 3113 

II III II II 1 1 Ml II I Mill I MINI lllllll 1 1 III II 1 1 III II II I II II II I 

Db 24 96 CCTCAGAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCAAGAACTGTGGACCATGC 2555 

Qy 3114 CAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACTGACCTCCTCTGGTGAC 3173 

MMIMIMMMMMIMMIIIMIIMIIIMIMMIIIMMIIIIMIIMI 

Db 2556 CAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACTGACCTCCTCTGGTGAC 2615 

Qy 3174 ACTGGCCTAGAGCCTGACACTCTCCTAAGAGGTTCTCTCCAAGCCCCCAAATAGCTCCAG 3 233 

MIMMIII lllllllllllllllllllllllllllllllllllllllllllllllll 

Db 2616 ACTGGCCTAGGGCCTGACACTCTCCTAAGAGGTTCTCTCCAAGCCCCCAAATAGCTCCAG 2675 

Qy 3234 GCGCCCTCGGCCGCCCATCATGGTTAATTCTGTCCAACAAACACACACGGGTAGATTGCT 32 93 

IMIMMIIIMMIMMIIIMMMIMMMMMIMIMIMIMMIMMI 

Db 2676 GCGCCCTCGGCCGCCCATCATGGTTAATTCTGTCCAACAAACACACACGGGTAGATTGCT 2 735 

Qy 32 94 GGCCTGTTGTAGGTGGTAGGGACACAGATGACCGACCTGGTCACTCCTCCTGCCAACATT 3353 

IIMIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIMII 

Db 2736 GGCCTGTTGTAGGTGGTAGGGACACAGATGACCGACCTGGTCACTCCTCCTGCCAACATT 2795 



Qy 3354 CAGTCTGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTACAGGGACAGGGAGC 3413 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 2 796 CAGTCTGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTACAGGGACAGGGAGC 2 855 

Qy 3414 CATCATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGAC 34 73 

IIIIMIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIM 

Db 2856 CATCATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGAC 2915 

Qy 34 74 CTTGAAGATGGGAAGGATGTTCTTTTTACGTACCAATTCTTTTGTCTTTTGATATTAAAA 353 3 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 2916 CTTGAAGATGGGAAGGATGTTCTTTTTACGTACCAATTCTTTTGTCTTTTGATATTAAAA 2 975 

Qy 3534 AGAAGTACATGTTCATTGTAGAGAATTTGGAAACTGTAGAAGAGAATCAAGAAGAAAAAT 3593 

IIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 2976 AGAAGTACATGTTCATTGTAGAGAATTTGGAAACTGTAGAAGAGAATCAAGAAGAAAAAT 303 5 

Qy 3 5 94 AAAAATCAGCTGTTGTAATCACCTAGCAAAA 3 624 

iiiiiiiiiiiiiiiiiiii mill III 

Db 3 036 AAAAATCAGCTGTTGTAATCGCCTAGCCAAA 3066 



